Aspect-Driven News Summarization
نویسندگان
چکیده
A summary of any event type is only complete if certain information aspects are mentioned. For a court trial, readers will at least want to know who is involved and what the charges and the sentence are. For a natural disaster, they will ask for the disaster type, the victims and other damages. Will a co-occurrence or frequency-based sentence extraction summariser automatically provide the requested information, or are the results better if an information extraction (IE) system first detects the summary-crucial aspects? To answer this question, we compared the performance of a purely co-occurrence-based method with a system that additionally makes use of targeted IE. As each event type requires different information aspects and not all of them were covered by the existing IE software, we used a tool that learns semantically related terms to cover the remaining aspects. The comprehensive evaluation in the TAC’2010 competition showed that event extraction is indeed beneficial for summarisation performance, and that summary quality is directly related to IE quality. Our integrated system was ranked among the top systems participating at TAC.
منابع مشابه
WHUSUM Participation at TAC 2011 Guided Summarization Track
In this report, we present details about the participation of WHUSUM in the guided summarization track at TAC 2011. Guided summarization task requires participants to produce short, coherent summaries of news articles with the guidance of predefined categories and aspects for each category. This year, we extended our query-focused update summarization system with aspect related information. In ...
متن کاملSummarization of Broadcast News Video through Link Analysis of Named Entities
This paper describes the use of connections between named entities for summarization of broadcast news. We first extract named entities from a transcript of a news story, and find related entities nearby. In the context of a query, a link graph of relevant entities is rendered in an interactive display, allowing the user to manipulate, browse and examine the components, including the ability to...
متن کاملEntity-driven Rewrite for Multi-document Summarization
In this paper we explore the benefits from and shortcomings of entity-driven noun phrase rewriting for multidocument summarization of news. The approach leads to 20% to 50% different content in the summary in comparison to an extractive summary produced using the same underlying approach, showing the promise the technique has to offer. In addition, summaries produced using entity-driven rewrite...
متن کاملReader-Aware Multi-Document Summarization: An Enhanced Model and The First Dataset
We investigate the problem of readeraware multi-document summarization (RA-MDS) and introduce a new dataset for this problem. To tackle RA-MDS, we extend a variational auto-encodes (VAEs) based MDS framework by jointly considering news documents and reader comments. To conduct evaluation for summarization performance, we prepare a new dataset. We describe the methods for data collection, aspect...
متن کاملA Platform for Multilingual News Summarization
We have developed a multilingual version of Columbia Newsblaster as a testbed for multilingual multi-document summarization. The system collects, clusters, and summarizes news documents from sources all over the world daily. It crawls news sites in many different countries, written in different languages, extracts the news text from the HTML pages, uses a variety of methods to translate the doc...
متن کاملAnalysis and Modeling of Manual Summarization of Japanese Broadcast News
We describe our analysis and modeling of the summarization process of Japanese broadcast news. We have studied the entire manual summarization process of the Japan Broadcasting Corporation (NHK). The staff of NHK has been making manual summarizations of news text on a daily basis since December 2000. We interviewed these professional abstractors and obtained a considerable amount of news summar...
متن کامل